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Abstract 

Spatial-temporal linear model and the corresponding likelihood-based statistical inference 
are important tools for the analysis of spatial-temporal lattice data. In this paper, we study 
the asymptotic properties of maximum likelihood estimates under a general asymptotic frame- 
work for spatial-temporal linear models. We propose mild regularity conditions on the spatial- 
temporal weight matrices and derive the asymptotic properties (consistency and asymptotic 
normality) of maximum likelihood estimates. A simulation study is conducted to examine the 
finite-sample properties of the maximum likelihood estimates. 

Keywords: Autoregressive models; increasing domain asymptotics; infill asymptotics; linear 
regression; spatial-temporal process 
1. Introduction 

Spatial-temporal linear models are important tools for the analysis of spatial-temporal lattice 



data and have been applied in a wide range of disciplines (see, e.g., Anselin, 2001 , Baltagi, 2005 



and Cressie, 1993). A spatial-temporal linear model relates the response variable of interest 
to covariates via a linear regression component and models the spatial-temporal dependence in 
data via a random error component that is assumed to be a zero-mean Gaussian process. In this 
paper, we focus on simultaneous-autoregressive (SAR) type of spatial-temporal linear models 
for random error components and study the asymptotic properties of statistical inference via a 
maximum likelihood method. 

For statistical inference of spatial-temporal linear models, maximum likelihood estimation is 



often adopted. For the spatial-only case, Mardia and Marshall (1984) established that the max- 



imum likelihood estimates (MLE) of the parameters are consistent and asymptotically normal 



as the sample size tends to infinity for general spatial linear models. Lee (2004) studied the 



asymptotic properties of quasi-maximum likelihood estimators (QMLE) for a lag SAR model. 



Recently, Zheng and Zhu (2011) considered an error SAR model with general neighborhood 
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structures and explored the asymptotic properties of MLEs under a unified asymptotic frame- 



work. Robinson and Thawornkaiwong (2012 ) developed the asymptotic normality of ordinary 
least squares (LS) and instrumental variables (IV) estimates of linear and semiparametric partly 
linear regression models for spatial data and discussed the consistency of the estimates of the 
spatial covariance matrix. However, it is not clear how general the asymptotic framework is. 



For the spatial-temporal case, Yu et al. (2008 ) investigated the asymptotic properties of QM- 



LEs for spatial dynamic panel data with fixed effects and proposed a bias-adjusted estimator. 



Lee and Yu (2010a ) and Lee and Yu (2010b ) generalized the spatial dynamic panel data model 
to include both time and individual fixed effects and studied the asymptotic properties of QM- 
LEs. Here, we consider the asymptotic properties of maximum likelihood estimation for spatial- 
temporal linear models. Across space, we consider the three types of asymptotic frameworks 



defined in Zheng and Zhu (2011), namely increasing domain, infill and hybrid of increasing do- 
main and infill asymptotics. Over time, we assume that the number of time points tends to 
infinite as is traditionally done in time series, but will discuss the case with fixed time points. 

The remainder of the paper is organized as follows. In Section 2, we describe the spatial- 
temporal linear models. In Section 3, we develop maximum likelihood for inference and the 
main theoretical results are given in Section 4. A simulation study is performed in Section 5. 
Conclusions and discussions are given in Section 6. Technical proofs are shown in the Appendices 
A-B. 

2. Spatial- Temporal Linear Model 

Let Yu denote the response variable at site i and time t, where i = 1, . . . , n and t = 1, . . . , m. 
Let 

Y lt = X'a/3 + €«, (1) 

where Xu = (Xm, . . . , X p n )' is a vector of the covariates and f3 = (/?i, . . . , (3 p )' a vector of 
the regression coefficients. To formulate a spatial-temporal model for the random errors {en : 
i = l,...,n,t = l,...,m}, we focus on a SAR-type model and specify the model for e t = 
(eit, . . . , e n t)' in terms of the errors from time max{l, t — s} to t, 

min{s,t— 1} 

e t = Yl Qet-i + ut, (2) 

1=0 

where C\ = [c^]^- =1 , 1 = 0,... , min{s, t — 1}, s > 0, are spatial-temporal dependence matrices 
and Ut = {fit, . . . , v nt )' ~ A r (0, a 2 I n ) is a vector of white noises. Let Y t = (Yu, . . . , Y nt )' denote 
the n— dimensional vector of response variables on the entire spatial lattice for a given time 
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point t and Y nm = (Y{, ■ ■ ■ , Y^)' the nm— dimensional vector of response variables at m time 
points. Let e nm = (en, . . . , 

£ni? • • • > £imi • • • ■> Enm)' be a 7i?7i-dimensional vector of random errors 
and v nm = {yn, . . . , v-rii, ■ ■ ■ , vi m , ■ ■ ■ , v nm )' a nm-dimensional vector of white noises. We have, 
under Q and ©, 

Y nm X nm f3 -\- € nm , and tnm Cc nm + l^nmj 

where X nm is a nm x p design matrix, C is a lower-triangular matrix with Co as the diagonal 
blocks and Ci as the Ith. sub-diagonal blocks for I = 1, . . . , s. So the joint distribution of the 
response variable is 

Y nm ~ N (X nm f3,a 2 (I nm - C)~ 1 (I nm - C'y 1 ) . (3) 

where I nm is an nm x nm identity matrix. 

The model ([2]) is quite general and features a variety of neighborhood structures over space 
and time. Let M n {i) = {j : site j is a neighbor of site i} denote the neighborhood of site i. The 
neighborhood of site % can be further partitioned into q orders, such that N n {i) = U^ =1 A/" ni fc(i) 
where M n ,k{i) = {j '■ site j is a A;th order neighbor of site i}. Let W n fc = [w l ^\^j =l be a n x n 
spatial weight matrix with zero diagonal elements for all 1 < k < q. An example is binary 
spatial weights such that w'i = 1 if j G N n ^{i) and otherwise. Some special cases of model 
([2]) are as follows: 

• Spatial independence: s > 1, Co = and C = otiI n for Z = 1, . . . , s. 

• Temporal independence: s = and Co = Ylt=i ^JfcWn*- 

• Spatial-temporal separable neighborhood structure: s > 1, Co = Sfc=i @kW n k and C = 
a^I n for / = 1,... ,s. 

• Spatial-temporal non-separable neighborhood structure: s > 1, Co = Ylk=i^ k ^ nk anc ^ 
C = a,I n + £Li« fc forZ = l,..., S . 

Regularity conditions on the parameter space are needed to ensure that the model specified 
under ([T]) and ([2]) is valid. Since C is lower-triangular, a sufficient condition to ensure the non- 
singularity of Inm — 

C is that its diagonal blocks I n — Ylk=i ®kW n k are non-singular, where I n 
is an n x n identity matrix. For row standardized spatial weight matrices W n k, if Y2t=i ^ ^' 
then / n — X]fc=i ^Wnfe is nonsingular and thus the covariance matrix in (|3|) is positive definite 



(Corollary 5.6.16, Horn and Johnson, 1985 ). We let = (9i, . . . , 9 q )' G 0, where is a compact 
subset of W 1 . In the following, we will focus on a spatial-temporal linear model with a separable 
spatial-temporal neighborhood structure and s = 1. The results can be extended to general 
s > 1 readily. We let a G ^4 a , where is a compact set of (—1, 1). 



3. Maximum Likelihood Estimations 

Let T] = (/3', cr 2 )' denote the {p + (q + 1) + l}-dimensional vector of unknown parameters 
under the model specified in ([1]) and ([2]), where £ = (9', a)'. The log-likelihood function, up to 
a constant, is 

£( V ) = -(nm/2)loga 2 + log|S nm (£)| - (2o 2 y x v' nm v nm . (4) 

where S nrn {$) = I nm - C and u nm = S nm ($)(Y nm - X nm (3). The fist-order derivatives of l(r)) 
with respect to j3 and a 2 are, respectively, 

= (a 2 )-'X' nm S' nm (^ nm , = [2^)-\u' nm u nm - nma 2 ). 

By setting the score functions equal to zero, we obtain the maximum likelihood estimate (MLE) 
of (3 and a 2 , 

0nm{£) = {X nm S nm (£)S nm (£)X nm } X nm S nm (£)S nrn (£)Y nrn , 

^nm(^) = (nm) {Y nm — X nm /3 nm (^)} S nrn (£)S nm (£){Y nm — Jf nm /3 nm (^)}. 

We define a profile log-likelihood function of £ as 

m =^/W£U,£L(£)} = -(nm/2)log<7 nfn ^) 2 +log|5 nm (0l -nm/2. (5) 

Then the MLE of £ maximizes the profile log-likelihood £(£) and is denotes as l- nm . 

4. Asymptotic Properties 



In Zheng and Zhu (2011), three types of asymptotic frameworks are defined for spatial linear 
models on a lattice in terms of the volume of the spatial domain (i.e., the Lebesgue measure) 
and that of the individual cells. 

• Increasing domain asymptotics: The volume of the spatial lattice tends to infinity while 
the volume of each cell on the lattice is fixed. 

• Infill asymptotics: The volume of the spatial lattice is fixed while the volume of each cell 
on the lattice tends to zero. 

• Hybrid asymptotics (increasing domain combined with infill asymptotics): The volume of 
the spatial lattice tends to infinity and the volume of each cell on the lattice tends to zero. 

In this section, we study the asymptotic properties of the MLEs of the model parameters for the 
spatial-temporal linear model defined in (pQ) and ([2]). We consider all three types asymptotics in 
the spatial domain and assume that time tends to infinity. 



Let ijq = (/3(/, £o' , Co 2 )' denote the {p+(g+l) + l}-dimensional vector of true parameters and 
Sonm = S nm (£ Q ). The model evaluated at the true parameters r/ is Y nm = X nm f3 + S^m^onm- 
To establish the asymptotic properties of MLEs of the model parameters, we impose the following 
regularities. 

(A.l) The elements w l 't of the spatial weight matrix W n /% are at most of order uniformly 
for all j ^ i and w l ^ k = 0, for all i = 1, . . . , n and k = 1, . . . ,q and h n is bounded away 
from zero uniformly. 

(A. 2) The sequence of spatial weight matrices {W n k '■ k = 1, . . . , q} are uniformly bounded in 
matrix norms || • |^ and || • H^. For a matrix A = [(H,j]ij = i, the maximum column sum 
matrix norm || • ||i is defined by ||A||i = maxi<j< n E£=i \ a i,j\ an d the maximum row sum 



matrix norm || • | |oo is defined by | (Alloc = maxx<i< re Ej=i \ a i,j\ ( Horn and Johnson, 1985 >. 

(A. 3) The matrix S n (0) is nonsingular for 9 S and n E N, where S n (6) = I n — Y2k=i ^kWnk- 

(A. 4) The sequence of matrices {S" 1 (9)} is uniformly bounded in matrix norms 1 1 • | ^ and 1 1 • 1 1 
for 9 £ Q. The true parameter £o is i n the interior of S, where 5 = 9x A a . 

(A. 5) The elements of X nm are uniformly bounded constants. The limit of (rim)~ 1 Xj im Sj im (^)S nm (^)X r 
as nm — > oo exists and is nonsingular for £ G S. 

(A.6) For £ Co, lim nm ^ 00 / ln (nm)- 1 {logK 2 n(C)S; m (C)S„ m (^)| - log | 1} ± o, 

where <r%n(£) = (nm^afa {s , 0ra 1 n ^ m (C)S'„, m (C)S' 0n 1 m }. 

Assumptions (A.l) and (A. 2) are regularity conditions on the spatial weight matrices, where 
assumption (A. 2) is generally satisfied under row standardization. Here the order of the elements 
in the spatial weight matrices /i" 1 is an essential element in the specification of an asymptotic 
type in the spatial domain. We assume w^ k = 0{h~ l ), where h n could be bounded or tend to 
infinity. Consider an example with distance-based neighbors on a regular spatial lattice (Ex- 
ample 2 in |Zheng and Zhu, 201 ip . Let dij denote the Euclidean distance between sites % and 
j and aA = X{dij ~ (5k-i,Sk]} with prespecified threshold values 5q = < Si < ■ ■ ■ < S q . 
Then row standardized weight matrices based on A n & = [a^]™ J=1 are W n k with elements 
w nl = a nk/ EJ=i a nl For this example, h n = C(max{^™ =1 aj£ : k = 1, . . . , q, i = 1, . . . , n}), 
which is bounded under increasing domain asymptotics. Under infill asymptotics h n — y oo 
and h n /n does not tend to as n — > oo, whereas under hybrid asymptotics, h n — > oo and 
h n /n — > as n — > oo. Assumptions (A. 3) and (A. 4) are standard assumptions made about 
the sequence of matrices S n (9), where assumption (A. 4) is needed to ensure that the vari- 
ance of Y n is bounded. For row standardized spatial weight matrices W n k, if E|=i \@k\ < 1> 



then assumption (A. 3) and (A. 4) are satisfied (Corollary 5.6.16, Horn and Johnson, 1985). As- 
sumption (A. 5) is a standard assumption of the design matrix and implies that the elements 
of nm{X' nm S' nm ($,)S nm ($,)X nm }~ 1 are uniformly bounded. Assumption (A. 6) is needed to es- 



tablish identifiable uniqueness when establishing consistency of the MLE (see, e.g. Lee, 2004 



Zheng and Zhu, 2011[ ). For the case with spatial independence, it can be shown that as- 



^ for 



sumption (A.6) is simplified to linw^oo h n log (nm) 1 tr ^S' J n S' nm ($,)S nm ($)S 0l l m j 
£ £oj which is satisfied if h n converges to a non-zero point. 

Let = f3nm(inm),elm = °nm(£nm)> and f] nm = ((3' nm , $,' nm , 6% m )' be the MLE of rj. We 
have the asymptotic properties of the MLE of r\ as follows. 

Theorem 1. Assume that (A.1)-(A.6) hold and h n /{nm) — > as nm — > oo. Then the MLE of 

V 

r\ is consistent such that, as nm — > oo, f\ nm — > tjq. 

Theorem [1] gives the consistency of the MLE of model parameters. It shows that, under the 
regularity conditions, when the size of the spatial domain n — > oo, 

• if the number of time points m is fixed, then the MLE of r\ is consistent when either 
h n = 0(1) or h n — > oo but h n /n — > as n — > oo, which correspond to increasing domain 
asymptotics and hybrid asymptotics in the spatial domain, respectively. This result is the 



same as for the spatial-only case in Zheng and Zhu (2011 ). 



• if the number of time points m — > oo, the MLE of ij is consistent in cases (i) h n = 0(1); 
(ii) h n — > oo but h n /n — > 0; or (iii) h n — > oo but h n /n — > c G (0, oo], where the case (iii) 
corresponds to infill asymptotics in the spatial domain. 

When the size of spatial domain n is fixed, the MLE of rj is consistent only if m — > oo, which is 



consistent with results in Yu et al. (2008). 



Theorem 2. Assume that (A.1)-(A.6) hold. 

(i) If h n = 0(1) and the limit of — {nm)~ l E | g^g^/ 1 as nm — > oo exists and is positive 
definite for r) & W x H x M + , then the MLE of r] is asymptotic normal such that, as 
nm — > oo, 

(nm)^ 2 (f,nm-Vo) ^ N(0,-E m ) 
where E^ 1 = - linw^oo (nm)"^ j %^g°) j • 

(ii) If h n — > oo and h].^ s /(nm) — > for some 5 > as nm — > oo, and if the limit of 
—hninm^^E | *qqqq, | as nm — > oo exists and is positive definite for 6 £ G, then the 
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MLE of rj is asymptotically normal such that, as nm — > oo, 

(nm) l l 2 {p nm - A,) 4 ^(0,5^), H'/ 2 (4- a 2 )^iV(0,2^) 
(nm/^) 1 / 2 ^-^) 4 iV(O,S 0o ), H^i-a^^fO,^), 

w/iere = cj^ lim^^oo nm (X4 m 5o nm S , 0nm X nfn )~ 1 , S^ 1 = lim^,^ E j -h^nm)' 1 } 
and £ ao = lim nm ^ oo nm |tr(_ff 2 m + H nm H nm )^ . Here H nm — fnm^() nm i where F nm 
is a matrix with all elements equal to zero except that the first- order lower sub- diagonal 
blocks are I n . 

Theorem [2] is about the normality of the MLE of model parameters. Theorem [2] (i) shows that 

• if the sample size n — > oo and time points m — > oo, then the MLE of rj is asymptotically 
normal at a convergence rate of square root of nm when h n = 0(1). 

• if the time points m is fixed but n — > oo, the convergence rate is reduced to the square 
root of n as for the spatial-only case. 

• if the size of spatial lattice n is fixed but m — > oo, then the MLE of 77 is asymptotically 
normal at a convergence rate of square root of m, which is consistent with results in 



Yu et al. (2008). 



Theorem [2] (ii) requires that the size of spatial lattice n — > 00. It shows that 

• if m — > 00, when h n — > 00 but h n /n — > 0, or h n — > 00 but h n /n — > c € (0,oo], then 
the MLE of r\ is asymptotically normal with a convergence rate \Jnm for the regression 
coefficients /3, the temporal autoregressive coefficient a and the variance component a 2 



and a convergence rate ^/nmjh n for the spatial autoregressive coefficients 0. 



• if to is fixed, then similarly to the spatial-only case in Zheng and Zhu (2011), the MLE 
of rj is asymptotically normal with a reduced convergence rate y/n for the regression 
coefficients /3, the temporal autoregressive coefficient a and the variance component a 2 
and a convergence rate -sjnjh n for the spatial autoregressive coefficients 6 when h n — > 00 
and h n /n — > 0. 

The MLEs of model parameters can be obtained using the Newton-Raphson algorithm. For 
the large-sample case, according to the consistency of MLEs, under the assumptions in Theorem 
2 (i) we can estimate the covariance matrix by using 



(nm) 1 E 



(dH(rj)Y 


-1 


\ dridrj' J 





V=Vn 
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The detailed formats of the elements in — (nm) l E | g^^j | are given in Appendix B. The 
covariance structure under the assumptions in Theorem 2 (ii) can be estimated similarly. 

5. Simulation 

We now conduct a simulation study to examine the finite-sample properties of the MLEs under 
the three types of asymptotics across space and with an increasing number of time points. We 
consider an r x r square lattice with a unit resolution and m temporal points. We vary the 
number of time points by letting m = 2, 5 or 10. For each value of m, we vary the lattice size 
by letting r = 4 or 8. For each lattice size, we further divide each cell into an r* x r* sub-lattice 
and vary the sub-lattice size by letting r* = 1, 2, or 4. Thus, for each time point, the sample 
size n ranges from 16 (r = 4; r* = 1) to 1024 (r = 8; r* = 4). 

For a given lattice size r, sub-lattice size r* and temporal length m, we simulate data from 
the spatial-temporal model defined in ([I]) and ((2J). For the linear regression, we let E(Yn) = 
/3q + (3iXi, where = sin(z), (3q = 2, and /3± = 2 for the ith. cell, i = 1, . . . , n and t = 1, . . . ,m. 
For the spatial dependence, we consider distance-based neighborhood with order q = 1. We let 

= I{dij £ (0, 1]}, where dij denotes the Euclidean distance between sites i and j, and then 
define a row standardized weight matrix w 1 ^ = a^ x j Y^=i a ni- The parameter values are set at 
6\ = 0.8, q = 0.2, and a 2 = 1. For each simulated data, we estimate the model parameters by 
using the maximum likelihood method and obtain $o,Pi,6\, a, and a 2 . We repeat this procedure 
100 times. 

Tables [TJ [2] and [3] give the means and standard deviations of the MLEs. First, for a given 
number of time points m, we note that in general, the biases and standard deviations of all five 
parameter estimates decrease as the lattice size r increases from 4 to 8 for any given sub-lattice 
size r* or as both r and r* increase, which correspond to the increasing domain asymptotics 
and hybrid asymptotics across, respectively. Next, for a given number of time points m and 
given lattice size r, we consider the results over all sub-lattice size r*, which corresponds to 
infill asymptotics. In general, the biases and standard deviations of the regression coefficient 
estimates /3o, $i, the variance component estimate a 2 and the temporal autoregressive coefficient 
estimate a decrease as the sub-lattice size r* increases from 1 x 1 to 4 x 4. However, for the 
spatial autoregressive coefficient estimate 9, its biases and standard deviations remain similar as 
r* increases, which indicates that 9 is inconsistent under the infill asymptotics when the number 



of time points m is fixed. This result agrees with the spatial-only case in Zheng and Zhu (2011 ). 
Last, as the number of time points m increases from 2 to 10, we note that the overall performance 
of all five parameter estimates improves with either fixed or increasing lattice size r and sub- 
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lattice size r*. 



6. Conclusions and Discussion 

In this paper, we have studied the asymptotic properties of maximum likelihood estimates under 
a general asymptotic framework for spatial-temporal linear models. We have considered three 
types of asymptotics in the spatial domain and let the number of time points tend to infinity. 
Under mild regularity conditions on the spatial-temporal weight matrices, we have derived the 
asymptotic properties (consistency and asymptotic normality) of maximum likelihood estimates. 
The results can be easily extended to models with temporal lags s > 1. It is plausible that the 
asymptotics of MLEs for models with a general non-separable spatial-temporal neighborhood 
structure can be developed in a similar technique, which is currently under investigation. 

In our spatial-temporal autoregressive models, we assume that the errors are zero with e\ = 
at initial time points 1 — s < I < 0. An alternative way to formulate the process is to pre-specify a 
distribution for the errors at the initial time points. An analogy is an AR(1) model in time series 
et = P^t-i + vt, where v t ~ hd iV(0, a 2 ). It is conventional to let e\ ~ N(0, a 2 /(l — p 2 )) such that 
var(ef) = cr 2 /(l — p 2 ). However, this would be challenging for general spatial-temporal process. 
In addition, a spatial-temporal process can be proposed to condition on the initial s time points 
with e t = Ya=o C i e t~i + v u where u t = {fit, v n t)' ~ hd N(0, a 2 ) for t = s + 1, . . . , m. When 
m goes to infinity, MLEs will behave similarly to those under our model specification. 

Appendix A: Proof of Theorem 1 

From fl3J), we have that, under the true parameters 770, 

E{£(n)} = -(nm/2)lo g( r 2 +log|S nm (OI 

j'-l a> (t\.a (t\o-l \ 



-(2a 2 )" 1 [(A, - (3yX' nm S' nm (OS nm (OX nm ((3 -(3)+ a 2 tr {S^S'^Snm^S^ _ 

The first derivatives of Ei{rf) with respect to /3 and a 2 are, respectively, 

= {a 2 )- l ^ nm S' nm (i)X nm {f3, - (3), = (2a i r l {E{u' nm u nm ) - nma 2 }. 

Thus the maximizers of Et(rj) are 

Plmii) = A), and o* 2 m {£) = (n?n)-V 2 tr{5o^S; m (^)S nm (^)S 0n 1 m }. 

Let g nm (£) = E [e{/3* nm (t),Z, = -(nm/2) log< 2 m (^) + log \S nm (£)\ - nm/2. We 

establish the consistency of £ nm by showing that sup h n (nm)~ 1 \£(£) — g n m(€)\ = o p (l) and that 
/i„(nm) _1 9nm{$.) is identifiably unique (White, 1994). 
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To show sup h n {nm) — g n m(£)\ = °p(l)> we have 

(. -■ 

h n (nm)- l (Z{£) - g nm (£)) = (log<7 2 m (£) - log< 2 m (£)} = -M^te)} -1 {^(0 - 

where cr£ m (£) = Aa*^(£)+(l-A)6-;i m (£) for some A G (0, 1) ando^JO = (nm)-Vj nm B„(5)!/ 0l!m 
with 

Bnm(£) = SQ nm S nrn {^)\I nm — S nm (£)X nm {X nm S nm (£)S nm (£)X nm } X nm S nm (£)]S nrn (£) S 0nm . 
First, we have that h n {<5^ m (0 - ff'J,^)} = o p (l). By (A.1)-(A.4), we know 

h n (nm) 1 ^o nm S 0n l n S nm (^)S nm (^)S 0j ^ m Uo nm — E 1 I^Onm^Onm^nmCO^mCC^Onm^Onm} = °p(l)i 
as nm/h n —> oo, where the convergence is uniform on S because of the linear-quadratic form in 



£ and by corollary 2.2 of Newey (1991 ). 
By (A.1)-(A.5), 

h n (nm) 1 v 0nrn S 0nm S nm ($,) S nm ($,)X nm {X nm S nm (£)S nm (£)X nm } X nm S nm (£)S nm (£)S 0nm vo m 
= h n {nmy l |(nm) _1/2 X4 m 5^ m (^)5 nm (^)5 0n 1 m i/onm} {(nmy 1 X' nm S' nm ($,)S nm ($,)X nm y 1 
x | (nm)- 1/2 X' nm S' nm (£) S nm (£ )5 0n 1 m f 0nm } 

= Op(l) 

Again, the convergence is uniform on S, since (nm)" 1 X' nm S' nm {$,) S nm {£) SQ^ m UQ nm = o p (l) uni- 
formly on H, {(nm) _1 X4 m S^ m (£)iS' rem (£)X nm } _1 is uniformly bounded in l^, and the bound- 
edness is uniform on H by (A. 5). Thus it follows that, uniformly on S, 

(0} 

= h n (nm) [vb nm S 0nm S nm (£)S nm (£)S Qnm Vo nm — E{vQ nm S Q ^S nm (£)S nrn (£)SQn m VQ nm }} 

—h n (nm) 1 v 0nm S 0n l n S nm (£)S nm (£)X nrn {X nm S nm (£)S nm (£)X nin } 1 X nm S nm (£) S nm (£) S^^or, 
= o p (l) 

Next, we show the uniform boundedness of o^ m (£). By Jensen's inequality, 

{nm)~ 1 {g nm (£) - ff n m(£o)} = (™) 4 ('og |5„,m(^)| - log |S mn|) - 1 /2{ log cr* 2 ^) - logcr 2 } 

for £ G H. Under (A.1)-(A.4), 

(nm) _1 (log |S nm (£)| - log \S 0nm \) 



-(nm) 1 



tr{S-^(0diag{/ m ® W nl }, . . . ,tr{5"^(0diag{/ m <g> W nq }, tr{- S n l(i)A(a)} 



^oihn-^iOk-eok) (7) 



fc=i 
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where £ = A£ + (1 — A)£o f° r some A G (0, 1) and A(a) = aF nm . Thus, 

tog^nmte) = -1{nm)- l {g nm (£) - g n m{€o)} + 2(nm) _1 {log \S nm (£)\ - log \S 0nm \} + log a 2 
> 2 (ram) _1 {log |S„ m (£)| ~ log |Sown|} + logcr 2 

which is bounded from below uniformly on S, and (J*^ n {$) is bounded away from uniformly on 
3. Since o^ m (£) — <7*^j(£) = o p (l) uniformly on S, <r 2 m (£) is bounded away from in probability 
uniformly on E. Hence, sup /i n (nm) _1 |f(^) - g nm (£)\ = o p (l). 

To show the identifiable uniqueness of h n {nm)~ l g n m{$,) , we note that ft n (nm) _1 <7nm(£) is 
uniformly equicontinuous. In 

(6)} 

= ^H-Hlog |Snm(£l)| - log |S« m (6)} " /ln/2{log< 2 m (6) " log< 2 m (6)} 

= /i n (nm) _1 {log |S„ m (£i)| - log \S nm (&)} - h n {2af m )- 1 {a*n m {£i) - 

where a^ m = Acr* 2 n (^i) + (1 — A)cr* 2 n (^2) for some A G (0, 1) and is bounded away from 0, both 
terms are uniformly equicontinuous. Since by ((TJ), /i n (nm) _1 (log |<S nm (£i)| — log|S nm (^2)) = 
-0(1) ELl^fc ~ ^ofc) and with £ = A£i + (1 - A)£ 2 for some A G (0, 1), we have 

(£l) - 0"nm(6)} 

= -/t n (nm)~V 2 jNr |fl£^diag(I m ® W^)S n m(£)flw 

fc=i 

+4^ m (£)diag(I m ® ^jC} (0 lfe - 2fc ) + o%0(l)( ai - a 2 ) 
g 

= -a 2 O(l) J2(0lk ~ 02k) + o%0(l){ ai - a 2 ). 
fe=i 

Thus together with (A. 6) and ©, /i„(nm) _1 g n m{$,) is identifiably unique. Thus, the MLE of £ 
is a consistent estimator. 

The consistency of Si nrn {knm) can be derived directly from the consistency of <T n 2 m {knm) ■ 
Further, 

= A) + {X nm S nm (£ nm )S nm (£ nm )X nm } X nm S nm (£ nm ) S nm (£ nm ) SQ nm VQ nm 
q 

+ ^^($0fc ~~ ^nmfc){^nm'S'nm(^".m)'S'nm(^rim)^n.m} 3C nm S nm (£nm)diag( J m (g) W n fc)S l Q nm Z/0nm 

fe=l 

+{-^-nm^nm(5tim)Snm(^nm)-^nm} ^rim^^^Tim^i ' ' ' ' ^nm^n: 0) S 0nm l'o nrn 

where the last three terms are of order o p (l) by (A.1)-(A.5). 
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Appendix B: Proof of Theorem 2 

The case h n = 0(1) 

By (A. 4), £o is in the interior of 5. Thus, for sufficiently small e > 0, we have A e = {rj : 
\\rj — r}o\\ < e} C W x SxM + and P(fj n m £ A e ) — > 1 as n — > oo, where || • || denotes the Euclidean 
norm. We establish the asymptotic normality of the MLE by showing asymptotic normality of 
(nm)" l 9 y and convergence in probability of (nm)~ l d where f\ nm = At/ + (1 - A)?) nm 
for A 6 (0, 1) converges to rjo in probability. 

For convergence of (rtm)" 1 ^^ , we show that, under (A.1)-(A.5), N" 1 ! ^^ ~ 
= op(l) and nm-H^ 1 - E^^} = o p (l). Here a matrix is said to be O p (l) (or 
Op(l)) if all of its elements are of order O p (l) (or o p (l)). The second-order derivatives of l{rj) 
are 



d 2 £(r)) 
d(3d(3' 
d 2 £{rj) 
d(3da 2 
d 2 £{r]) 
d(3d9 k 
d 2 £(rj) 
d(3da 

where 



d 2 £(ri) 

da 2 da 2 
d 2 l(ri) 

dd k da 2 
d 2 £(rj) 

de k d9i 

d 2 £(rj) 
dada 2 
d 2 £(rj) 
dada 
d 2 £(r]) 

dadOk 



-{(J 2 ) 1 ~K nm S' nm (£)S nm (£)X. nm 
-(a 4 ) X. nm S nrn (^)i/ nm 

ia 2 r l X' nm {dmg(W' nk S n (e) + S' n (0)W nk , . . . , W nk S n (0) + S' n (e)W nk )}S~ l m (ti>nm, k 
-(a ) X nm {B nm B nm }S nm (£)t' nm , k = 1, . . . , q 



B, 



-al n S n (0) 



-al n S n (0) 



\ 



\ 















/ 



{2a & )' 1 {-2u' nm u nm + nma 2 ) 

-(^ 4 )" 1 ^ m diag(W nfc S- 1 (0), • • • , W nk S-\9))u nm 
-mtr{W nfc S- 1 (0)W n ^S- 1 (0)} - (a 2 )- 1 ^ m S / ^(^)diag(W;^ 

~( (J ) u nm^nmS nm (^)l'nm 

-(^ 2 )" 1 < m S^ m 1 (^)diag(I n , . . . , I n , 0)S^)v nm 



nki 



.,W' nl W nk )S-^)i 
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By (Al) - (A5), we have 



mm 



mm 



OpO) 



. u d 2 l{f, nm ) d 2 £( Vo ) 
1 d(3d(3' dpep 1 

{ — (<5"nm) \m^nm (0 ^™ (^)^nm + (Cq) \m^0nm^0iim.X n m} 



(7M7l) X n jSQ nm Sonm-^-nm( 9 ~2 ) ("""^imb) {■^■nm^Omn^Onm^mn ^-nm ^nm. (^nm ) Sum (^nm ) -^nm } 



a 2 



(nm) { 3/3&T 2 5/3&J 2 

(rem) 1 X nm S 0nm I^0nm( T " 

00 



} 



0" 



+ (rimer, 



nm I 



-lv' "p c — 1 



i^nrn) ^-nm^nrri{^.nm)S n m(^ nm )'X. n m{Po Pnm) 



o P (l) 



where 



E., 



( S'o n Son — S' n (6 nm )S n (0 nm ) O>Y^k=l(®0k — ^nfe)W n yfc 



\ 



11 k 



a YX=l(°Ok - QnkfWnk 
^On^Ort ~~ S n (0 nm )S n (0 nm ) J 



On 



(nm) {-(? nm ) X nm { J nm (^ nm ) + J nm (^ nrn )}S nm (^ nm )t / nm (^ nm ) + ((To ) X nm (Jonm + J()nm)S 
(77777) { — (cr nm ) X nm { J nm (^ nm ) + J nm (^ nm )}(Y nm — X nm /3o) 

+ (°"0 ) ^-Tim^Onm + Onm) nm ~ X nm /3o) — (<5"nm) ^rrniPimdnro) + Jnm(^m)}X nm (/3o — /3nm)} 



Op(l) 
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where J nm = diag(W; fe S n (0), • • • , W^S„(0)) 
f ,-ifd 2 £(fj nm ) d 2 £(rj ) 

(nm) { (& nm ) X nm {B nm (£nm) ^nm(^™)}(^imi ^-nmPnm) 
+(°"o 2 ) X^ m (Bonm + Bo nm )(Y nm — X nm /3o)} 
= (nm) 1 { — (cr^ m ) 1 X^ m {B nm (^ nm ) + B^ m (^ nm )}(Y nm — X nm /3o) 

+ (°"0 2 ) ^JjmtBonm + B' 0nm )(Y nm — X nm /3o) — (&nm) ^■'nmi^nmi^nm) + B^ m (£ nm )}X nm (/3o — /3 nm )} 

= (nm)- 1 [-mtr{W nfc S- 1 (0)W n/ S^ 1 (^)} + mtrfW^S^W^S^ 1 }] 

-(?im)- 1 ^ nm So nm " 1 diag(W^W nfc , • • • , W nl W nk )S^ m v 0nm (— - — ) 

-(?im)" 1 {(5-^ m )~ 1 (/3 - /3 nm )'X^ m diag(W^W nfc , • • • , W^W nA .)X nm (/3 - /3„ m ) 
-2(O _1 (0o - /3nm)X m diag(W;,W njfc; • • • , W^)S^ m i^ m } 

= Op(l) 

,_ u d 2 £(fj nm ) d 2 £(rj ) ^ 

= (nm) -1 {((°'o) _1 - (^m) _1 ) iy Onm dia g( w nifeSo n 1 , ■ ■ • (W^S^iw} 

-(nra) -1 ^)" 1 ^ - ^ nm )'X^ m diag(S^W nfc , • • • , S' n W nk )X nm (P'o ~ Pnm) 
-(nmy l (af im )~ 1 (p' - ^ rim ) / X^ m diag(S^W nfc So n 1 , • • • , S^W^S^ 1 )^ 
-(nm)~ 1 (^ m )~ 1 ^ nm diag(So- 1 S^W nfc , • • • , S^ 1 S^W nfe )X nm (/3o - /3 n 
-(nm)- 1 (^ m )- 1 ^ nm diag(S^ 1 (S n - S „)W rifc So n 1 , • • • , S^S* - S 0n )W nk S^)v 0ri 

= Op(l) 

, Wr d 2 l(r? nm ) 9 2 £(r 7o ) 1 
lnmj 1 C t 2 C t 2 0a W j 
= {(2^ m )- 1 - (2^)" 1 } + (nm)-^ nmIW (l/<7 6 " V*D 

— (nm) I^o nm S 0nrn {S 0nm Sonm ~ S nm (^nm)S nm (^ nm )}S 0nm Z/0nm 

_ t _ (Y wm X nm /3o) S nm (^ nm )S nm (^ nm )X nm (/3 nm /3o) 

""^(/^nm /^o) S nm (^ nm )S nm (^ nm ) ( Y nm X nm /3o) 

— (Pnm — Po)' 
= Op(l) 
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I! lit 



(nm) 1 {— {cr nm ) 1 ^ m (£nm)F nm S n ^(£ nm )i/ m71 + (<7q) 1 v 0nm F nm S n m L'onrn} 
(nm)- 1 {[(a^ m )~ l - (aQ)~ l ){Y nm - X nm /3 )'diag(a In, • • • , a I n , 0)(Y nm - X nm /3 )} 

0)X nm (/3 -/3 

+(A) - ^nm)X nm diag(a nm I n , • • • 

j OtnmXni 0) (Y nm X nm /3o) 

+(Y nm - X nm /3 )'diag((a nm - a )I n , ■ • ■ , («nm - «o)In, 0)(Y nm - X nm /3 )} 

Op(l) 

dada dada 

(nm) _1 {[(cr^) _1 - (<T^ m ) _1 ](Y nm - X nm /3 )'diag(I n , • • • ,I„,0)(Y nm -X nm /3 )} 

) {(Y nm - X nm /3 ) diag(I 0)X nm (/3 nm — /3o) 

+(Pnm - /3 )'X^ m diag(I 

ni " ' ; In> 0)(Y nm — X nm /3o) 

-(Aim - A))'X4 m diag(I 0)X n?n (/3 nfn — (3o)} 

o P (l) 



{nm) 9^r } = 0p(1) 



Thus, 



(nm) { 



drjdrj 1 drjdrj 



t\ = o P (l) 



Further, under (A.l) — (A.5),we have 



d 2 l(r?o) 
d(3d(3' 



nm) {- 



d(3da 2 



nm) {- 



d/3do k 



nm) {- 



d(3da 



nm) {- 



da 2 da 2 



,-lrd 2 l(V0 

nm) { 



E^M} 



d 2 i(y 
d(3d6 k 
d 2 £(y ) 
d(3da 

d 2 £(vo) 
da 2 da 2 
d 2 t(yo) 
dO k do 2 " dO k da 2 



d(5d(5 
d 2 £(rj t 



o, 



dfida 2 
'o 



■(nmJ'^ff^XmSonm^Onm = {nm)~ 1/2 X O p {\) = Op(l) 
■(H"Vo)~ 1X Iim( J Onm + ^Onm^Onm^Onm = O p (l) 

-(nm)~ 1 (cJo)~ 1 X^ m (Bonm. + B' 0nm )S^ m u 0nm = o p (l) 

-{nm)- l {al)~ l {u' 0nm v Qnm - nm) = o p (l) 

-(nm)~ 1 (a^)~ 1 {i/o„ m diag(W nfc So ri 1 , • • • ,W nk S^)u 0nm 
■(ao 2 ) x mtr(W nfe So n 1 )} = o p (l) 
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{nmyl{ mm " e ^m } = -(^)" 1 ^o 2 )~ 1 {^„ m s^dia g (w; / w rifc ,... ,w nl w nk )s^ m u 0nm 

-4 x tr{s^diag(w^w nfc> • • • , w^w nfe )s- 1 m }} = o p (l) 

(nm)_1{ S^ " E l^ ] = -M" 1 ^^^^ - a 2 x tr^S^}} = pp(l) 
(")"'{ ~ E Qad Q a } = -(^)" 1 (^o)~ 1 {^onmSo^diag(I n ,--- ,I„,0)So n 1 m i/ 0nm 

-0o x tr {So^diag(I n , • • • ,I n ,0)S^ m }} = o p {\) 

(nm) = ° 

Furthermore, the first-order derivatives of i^ij) at 770 are linear or quadratic forms of fonm since 

dl(?7o) _ , 2wl r / «/ S%o) _ /„ 4\-l/ / 2n 

TTg — l°0/ A mn' 5 0nm i/ 0nmi ~~da 2 ~ ~ \ U 0nm u 0nm ~ nm(T Q ) 

where G k = diag(J m <g> W nk S~ l (6 )) ,for k = 1, . . . , g. 
By (A.5), 

H"' /2 ^ ^ ^(0,^ liin (?im)- 1 (^)~ 1 X; m 5o nm 5 0nm X nm ). 



nm— > 00 



By a classic central limit theorem, 



( „ rar v2^)iU(o,(2,a)->; 

For asymptotic normality of (nm)" 1 '' 2 ^, (A. 2) and (A. 4) ensure that G k is uniformly 
bounded in matrix norm || • 1^ and || • and the positive definiteness of S~ Q ensures that 
(nm)" 1 Var(^^) = (nm)- 1 tr(G 2 + G' k G k ) is bounded away from 0. By a central limit theo- 



rem for linear-quadratic forms (Theorem 1, Kelejian and Prucha, 2001), we have 



(m) 4 * 3 N(0, lim (nm)-hr(Gl + G> k G k )) 

OO k nm— >oo 

for k = 1, ■ ■ ■ ,g. Similarly, we can get the asymptotic normality of (nm)~ 1 / 2 



-1/2 



^M^N(0, lim (nm)-hr(H 2 nm + H' nm H nm )) 



(nm) 

OCX nm->oo 

By Cramer- Wold Theorem and the fact that is asymptotically independent of ^gJL ^ 

and , for = 1, • • • , q, we have 
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where XL. 1 = lim„ ir ,._ s . tvl E 



[nm] 



-i d 2 £(yo) 
d(3d(3' 



E 



[nm] 



e( f^r 1 ^ 



(?i?ti(Tq) X nm S 0nm So nm X nrn , E y—(nm) 
(nm)~ 1 tr(G fc Gi + GjGfc), k,l = l,...,q 



(nmcl) 1 tv(G k ),k = l,...,q 



-l d 2 £( Vo ) 
df3d£> 



0, 



( —(nm) 



-i ggfog) 
dada 2 



(nm) hr(H 2 m + H' nm H, 
(nmo-o) _1 tr(iJ nm ), £ 



ran 



-i glfoo) 
d(a 2 ) 2 



4n-1 



Assumptions (A.1)-(A.5) ensure that all the elements in S 1 exist. Then it follows that 



(nm) 1/2 (f) ri 



■no) 



ran 



1/2 ^0?o) 2? 



5?7 



and thus the result of this theorem holds. 

In this theorem, we assume the existence and positive definiteness of the covariance matrix 
S^. Here we discuss a simple example about the validation of the existence of 5^0- Suppose 
we only consider the first-order spatial neighborhood, i.e. q = 1. Let r = (6', a 2 )'. Then 



■no 



lim 

nm->oo 



lim E I —(nm) 



(nma 2 ,) ^X' m Sn„^Sn nm X 



drjdrj' 

£ 











-(nm)^ 9 ^0) 

H"M fl L + H' nm H nm ) J 



lim (nm) 1 



(°"o) 1 -X-' nrn SQ nrn So nrn X nrrl 

tr(G? + GiGi) (^)-itr(G!) 

(a 2 )- 1 tr(Gi) nm(2(j 4 )- 1 









V tv(H 2 nm + H' nm H nm ), ) 

which is a block diagonal matrix. Assumption (A. 5) guarantees the existence and nonsingularity 
oflim nm _ >00 (nm(To)~ 1 X^ rnl S , o nm S'onm^nm.- Assumptions (A.2)-(A.4) ensure that lim nm _> 00 (nm)~ 1 tr(i?' 2 m + 
H' nm H nm ) is bounded and bounded away from zero. For the middle block, we have 

E (-H _1 |^) = (2nm4)~ 1 [tr(G 2 + G'.d) - 2(nm)" 1 tr 2 (G 1 )] 

= (4nm^)- 1 tr(G b G 6 ) = (Anmat)- l \\G b \\l > 0, 
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where G b = G\ + G[ — 2(nm)~ 1 ti(Gi) I nm and || • ||f denote the Frobenius norm. Hence, the 
matrix E ^ — (nm) ~ 1 ^ is positive definite in the large-sample case. Furthermore, assump- 

tion (A. 6) guarantees that it is positive definite in the limit (see, e.g. Lee, 2004). 

The case h n — > oo 

Now, we establish the asymptotic normality of the MLE 9 nm by showing the asymptotic 
normality of {^/(nm)} 1 ' 2 and ^ ne conver g ence i n probability of h n (nm) ~ l d g^gg? - , where 

£nm = A^o + (1 — A)£ nm for A G (0, 1) converges to £o in probability. The asymptotic normality 
of the MLE 

Qnm will be established by showing the asymptotic normality of (nm) — and 
the convergence in probability of (nm)" 1 ^?^. 

For convergence of h n (nm)~ l d ' qqq^ , we show that h n (nm) ~ 1 { 9 gjf d n ™ - ^}§§r } = o p (l) 
and h n (nm)~ l | 9 dew ~ ^ ^oDe^ } = °p(l)> un der (A.1)-(A.5). On the other hand, we have 
that (nm)-M^lH " = °p(1) and W" 1 " ^§^} = <*(!)■ Under(A.l)- 

(A.5), We note that 

-{^ m «)}-V 0wm aB r?^ ) ^ bwm - tr{(I m ® W^)} 



= {^nm^)} -1 ^^,!^)^ " tr{(I m ® W„i)S^(€)} 

-^-2- = 2{nm<T nm (£)} {iVm T nm ) l(£)*W} + {<J nm (€)} v Qnm — 1 

-tr{S^(0(Im ® W nl )S-i ® W„i)} 



II II! 



Tnm,l(£) = 

S n^{€) [Cm ® W„i)M nm (^)5 nm (^) - S^ m (£)(J m (8) W n i)X nm {X^ m 5^ m (^)5 nm (^)X nm }" 1 X^ m S'^ m (^)5, 



+ ^nm(0^nm(£)^nm{^nm^nm(0^nm(0^nm} X nm (I m ® W n i)S' nm (^)(^)^C rlm { X nm S nm (£) S nm (£) X. 

-X- nm S nm (£) S nm (£)~\ S nm (£) 

with M nm (£) = I nm - S nm (^)X nm {X; m 5; m (|)5 nm (^)X nm }- 1 x; m S; m (^). By(A.l)-(A.5) 

/i n (nm)" 1 i/Q nm T nmi i(^) o P (i) 
t / \— i / 9T nrn i(^) i 2 r <9T nm i(^) 

^Onm ^ VOnm = K(nm) CT tr{ — } + O p (l) 

Since T nr7lj i and — gg^ are uniformly bounded in either matrix norm || • ||i or || • ||oo> thus, 
under (A.l) - (A.5), 

d£ 2 (£) dT nm i(£) 

-Qf^T = 2 { nmd nrn{i)Y l { V Lrn^n m ,l(^0n m } 2 + {°nm(£)} "'onm ^ ^Onm 

-h n (nm)-hr{(I m ® W nl )S^(^)(I m ® W nl )S^(£)} 

= MH'M ^ 1 ^ } - M») 4 M(i m ® w rtl )s^(^)(i m w nl )s-^(0} + o p (i) 
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nmj 



and for £ nm = A£ + (1 - A)£ nm , 

^M^M 81 ^ } - tr{ aTn g ( ^ o) }} - ^N^H^ ® w n i)s-^(| nm )(i m ® w nl )s-^(| nm )} 

- tr{(I m ® W nl )So n 1 m (I m ® W nl )S 0n 1 m }] + 0p (l) 

= (€nm " 4o)'C(l) - (£ nm - £ yO(l) + Op(l) 
=Op(l) 



By similar argument for and 9k>,k,k' = 1, . . . , q, we have 

dOdO' dOdO 



Furthermore, 

-A»M -1 ^ = -^H"M ^# } + h n {nmyHr{G\) + o p (l) 

= -/t r i(nm) -1 {tr(G / 1 Gi) + tr(G 2 )} + o p (l) 

where Gi = (I m ® W n i)S "J ro , 

Also, we have ^(nm)- 1 !^© 1 " ^iS 1 } = °p(!)- Thus > ^H" 1 ^ 1 ~> ^ = 

To establish the asymptotic normality of |/i ra /(rem)} 1 ^ 2 ^ggr , we apply the central limit 



theorem for linear-quadratic forms in Appendix A of Lee (2004). Note that 



{h n /{nm)} 1 / 2 ^1 = {^/(nm)} 1 / 2 [{^(^jrVo^T^fe)^ - tr {(J m ® W„,i)^m}] 



{*L(0o)} _1 {KKnm)} 1 ' 2 {v'^G'^onm - ffgtr(Gi)} + o p (l) 



where 



Tnm, l(£o) 

= ^Onm [(-^"i ® ^ / nl)-^0nm'S l 0nm — S^ nrn (I m ® W n i) JC nm {-X^ nm S'g nm iS l 0n. m -^nm} -^-nm^Onm^Onm 
SQ nm SQ nm X nm {X nm SQ nm SQ nm X nm } X nm (I m ® VF ri i)iS'onm-^nm,{-^rim'^'Omn'^ l Onm-^n'm} 

With M 0nm = I nm - S nm^nm{^ m S'o nm 5onm^nm}~ 1 ^m' S 'onm- ( A - 2 ) and ( A - 4 ) ensure that 

Gi is uniformly bounded in matrix norms || • ||i and || • ||oo and the positive definiteness of X!^ 1 
ensures that {h n /(nm)}Yar(u' 0n Gii'o n ) = {h n / (nm)} o"otr(Gf + G[G\) is bounded away from 
zero. By Lee (2004), we have 

{h n /(nm)} X/2 {i^Giiw - ffgtr(Gi)} 4 N (o, lim / l „(nm)"V^{tr(G 1 G , 1 ) + tr(G 2 )} 
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Hence 



{Ji„/H} 1/2 ^ 4 iv(0 Jf Im h n (nm)- 1 {tr(G 1 Gi) + tr(G?)}) 
Then similarly for 9/., k = 2, . . . , q, we have 

{^n/H} 1/2 ¥ ^ ^(o, lim /i n (nm)~ 1 {tr(G fc G / fc ) + tr(G|,)}) • 
Thus, by the Cramer- Wold theorem, 

fu II ul/2^o) £> . r / n 
{An/M) —qq > N (°^0 n ) 



where 



J 



h n (nm)-Hr(Gl + G' 1 Gi) ... /i n (nm)- 1 tr(GiG g + G^Gi) 

/ in (nm)" 1 tr(G2 + G' q G q ) 



lim 

71— >00 



h n (nm)~ 1 tr(G q Gi + G[G q ) ... 
Similarly, we can show that 

(„ m )-i/2^o) 4 ^ / Q) Um (nm) -i tr{ff 2 +fl 7 and ^( nm )-i^^) = o (1 ). 

Thus it follows that 



-1 



h n (nm) ' 



dOda 



( nm )~ l 9 +°p( 1 ) 



dOdO' 

1,2 dm) 
da 



D 



and similarly, 

H 1/2 («L-a§)^(0,S ao ) 

with S Q0 = lim nr)WOO nm {tr(22£ m + H' nm H nm )} 1 . 

To establish the asymptotic normality of n m(,$nm) an d <3"„ m (£nm)> we have, under (A.l)- 
(A.5), 

(nm) 1/2 (/3 nm (^ nm ) - /3 ) 
= (nm) I |(nm) X nm S nm (£, nm )S nm (£ nrn ) X nm ^ X nm S nm (^ nm )S nm (^ nm )SQ nm UQ nrn 
= (nm) '' 2 ((nm) 1 X' nm S' 0nm So nrn X nrn ) X' nm S' 0nm u 0nm + o p (l) 
4 iv(o, lim (( nm) 1 X' nm S' 0nm So nrn X nrn ) 
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and 

H 1/2 (4(U) - Oq) = (nm)" 1 / 2 Knm^Onm - nmal) + o p {\) 4 iV(0, 2<rg). 
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Table 1: Means and standard deviations (SD) of maximum likelihood estimates (MLE) of the 
model parameters based on 100 simulated data. Here the lattice sizes are 4x4 and 8x8 with 
varying sub-lattice sizes lxl, 2x2 and 4x4 within each cell of the lattice, and the number of 
time points m is 2. 











4x4 






8x8 




Parameter 


Truth 


MLE 


1 x 1 


2x2 


4x4 


1 x 1 


2x2 


4x4 








(n = 16) 


(n = 64) 


(n = 256) 


(n = 64) 


(n = 256) 


(n = 1024) 




2.0 


Mean 


2.2553 


1.6990 


1.7385 


2.1934 


1.8016 


2.0995 






SD 


(0.2745) 


(0.0240) 


(0.0188) 


(0.0106) 


(0.0128) 


(0.0007) 


Pi 


2.0 


Mean 


1.8913 


2.2048 


2.1146 


2.1913 


2.1079 


2.1192 






SD 


(0.3293) 


(0.1026) 


(0.0622) 


(0.0143) 


(0.0546) 


(0.0254) 


a 2 


1.0 


Mean 


1.4461 


1.3757 


1.3426 


1.3532 


1.3213 


1.1849 






SD 


(0.3590) 


(0.1890) 


(0.0575) 


(0.1800) 


(0.0932) 


(0.0262) 


9 


0.8 


Mean 


0.9623 


0.7071 


0.7300 


0.6711 


0.7002 


0.9163 






SD 


(0.1953) 


(0.0692) 


(0.0528) 


(0.0252) 


(0.0369) 


(0.0118) 


a 


0.2 


Mean 


0.0225 


0.0649 


0.0787 


0.0503 


0.0820 


0.1008 






SD 


(0.1389) 


(0.0201) 


(0.0371) 


(0.0051) 


(0.0167) 


(0.0165) 
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Table 2: Means and standard deviations (SD) of maximum likelihood estimates (MLE) of the 
model parameters based on 100 simulated data. Here the lattice size are 4x4 and 8x8 with 
varying sub-lattice sizes lxl, 2x2 and 4x4 within each cell of the lattice, and the number of 
time points m is 5. 











4x4 






8x8 




Parameter 


Truth 


MLE 


1 x 1 


2x2 


4x4 


1 x 1 


2x2 


4x4 








(n = 16) 


(n = 64) 


(n = 256) 


(n = 64) 


(n = 256) 


(n = 1024) 




2.0 


Mean 


2.2430 


1.7536 


2.2494 


2.1473 


2.1481 


2.0992 






SD 


(0.0254) 


(0.0058) 


(0.0044) 


(0.0072) 


(0.0033) 


(0.0028) 


Pi 


2.0 


Mean 


1.7381 


2.1886 


2.1069 


2.1472 


2.0865 


2.0624 






SD 


(0.0616) 


(0.0191) 


(0.0188) 


(0.0167) 


(0.0220) 


(0.0157) 


a 2 


1.0 


Mean 


1.7640 


1.2659 


1.2123 


1.3342 


1.2075 


1.1496 






SD 


(0.5033) 


(0.0824) 


(0.0690) 


(0.1344) 


(0.0309) 


(0.0152) 


9 


0.8 


Mean 


0.7175 


0.7076 


0.7195 


0.6907 


0.7415 


0.7330 






SD 


(0.0627) 


(0.0246) 


(0.0258) 


(0.0354) 


(0.0192) 


(0.0171) 


a 


0.2 


Mean 


0.0536 


0.0774 


0.0902 


0.0804 


0.0937 


0.1131 






SD 


(0.1026) 


(0.0053) 


(0.0141) 


(0.0188) 


(0.0163) 


(0.0137) 
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Table 3: Means and standard deviations (SD) of maximum likelihood estimates (MLE) of the 
model parameters based on 100 simulated data. Here the lattice sizes are 4x4 and 8x8 with 
varying sub-lattice sizes lxl, 2x2 and 4x4 within each cell of the lattice, and the number of 
time points m is 10. 











4x4 






8x8 




Parameter 


Truth 


MLE 


1 x 1 


2x2 


4x4 


1 x 1 


2x2 


4x4 








(n = 16) 


(n = 64) 


(n = 256) 


(n = 64) 


(n = 256) 


(n = 1024) 




2.0 


Mean 


1.7480 


2.1839 


1.8039 


1.8889 


1.8924 


1.8988 






SD 


(0.0146) 


(0.0221) 


(0.0161) 


(0.0111) 


(0.0132) 


(0.0050) 


Pi 


2.0 


Mean 


1.8046 


2.1780 


1.9044 


2.1789 


2.1740 


2.0975 






SD 


(0.0686) 


(0.0319) 


(0.0092) 


(0.0240) 


(0.0446) 


(0.0105) 


a 2 


1.0 


Mean 


1.3016 


1.2165 


1.2162 


1.2812 


1.1589 


1.0664 






SD 


(0.1227) 


(0.1708) 


(0.0738) 


(0.1390) 


(0.1004) 


(0.0670) 


9 


0.8 


Mean 


0.7575 


0.7159 


0.7158 


0.7341 


0.7176 


0.7535 






SD 


(0.0339) 


(0.0254) 


(0.0212) 


(0.0321) 


(0.0303) 


(0.0143) 


a 


0.2 


Mean 


0.0654 


0.0781 


0.1146 


0.0973 


0.1331 


0.1445 






SD 


(0.0675) 


(0.0346) 


(0.0368) 


(0.0543) 


(0.0329) 


(0.0236) 
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